Fast and Exact Top-k Search for Random Walk with Restart

نویسندگان

  • Yasuhiro Fujiwara
  • Makoto Nakatsuji
  • Makoto Onizuka
  • Masaru Kitsuregawa
چکیده

Graphs are fundamental data structures and have been em-ployed for centuries to model real-world systems and phe-nomena. Random walk with restart (RWR) provides a goodproximity score between two nodes in a graph, and it hasbeen successfully used in many applications such as auto-matic image captioning, recommender systems, and link pre-diction. The goal of this work is to find nodes that have top-k highest proximities for a given node. Previous approachesto this problem find nodes efficiently at the expense of exact-ness. The main motivation of this paper is to answer, in theaffirmative, the question, ‘Is it possible to improve the searchtime without sacrificing the exactness?’. Our solution, K-dash, is based on two ideas: (1) It computes the proximityof a selected node efficiently by sparse matrices, and (2) Itskips unnecessary proximity computations when searchingfor the top-k nodes. Theoretical analyses show that K-dashguarantees result exactness. We perform comprehensive ex-periments to verify the efficiency of K-dash. The resultsshow that K-dash can find top-k nodes significantly fasterthan the previous approaches while it guarantees exactness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reverse Top-k Search using Random Walk with Restart

With the increasing popularity of social networks, large volumes of graph data are becoming available. Large graphs are also derived by structure extraction from relational, text, or scientific data (e.g., relational tuple networks, citation graphs, ontology networks, protein-protein interaction graphs). Node-to-node proximity is the key building block for many graph-based applications that sea...

متن کامل

Fast Nearest Neighbors in Large and Composite Networks

We address the problem of k Nearest Neighbor (kNN) search in networks using a random walk based proximity measure. Our approach retrieves the exact top neighbors at query time without relying on off-line indexing or summaries of the entire network. This makes it suitable for very large networks, as well as for composite network overlays mixed at query time. We provide scalability and flexibilit...

متن کامل

Scalable Nearest Neighbors with Guarantees in Large and Composite Networks

We address the problem of k Nearest Neighbor (kNN) search in networks, according to a random walk proximity measure called Effective Importance. Our approach retrieves the exact top neighbors at query time without relying on off-line indexing or summaries of the entire network. This makes it suitable for very large dynamic networks, as well as for composite network overlays mixed at query time....

متن کامل

Fast Inbound Top-K Query for Random Walk with Restart

Random walk with restart (RWR) is widely recognized as one of the most important node proximity measures for graphs, as it captures the holistic graph structure and is robust to noise in the graph. In this paper, we study a novel query based on the RWR measure, called the inbound top-k (Ink) query. Given a query node q and a number k, the Ink query aims at retrieving k nodes in the graph that h...

متن کامل

Restart and Random Walk in Local Search for Maximum Vertex Weight Cliques with Evaluations in Clustering Aggregation

The Maximum Vertex Weight Clique (MVWC) problem is NP-hard and also important in realworld applications. In this paper we propose to use the restart and the random walk strategies to improve local search for MVWC. If a solution is revisited in some particular situation, the search will restart. In addition, when the local search has no other options except dropping vertices, it will use random ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PVLDB

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2012